Provenance in Databases: Why, How, and Where
نویسندگان
چکیده
Different notions of provenance for database queries have been proposed and studied in the past few years. In this article, we detail three main notions of database provenance, some of their applications, and compare and contrast amongst them. Specifically, we review why, how, and where provenance, describe the relationships among these notions of provenance, and describe some of their applications in confidence computation, view maintenance and update, debugging, and annotation propagation.
منابع مشابه
Improv: Flexible Data Provenance for Relational Databases
Curated databases, which consist of data extracted from original sources, printed articles, and other databases, are a valuable source of data for scientists. However, as curated databases aggregate information from multiple sources, the origin of the data elements can be lost. Because of this, curated databases often provide support for data annotations, which are pieces of extra information a...
متن کاملWhy and Where: A Characterization of Data Provenance
With the proliferation of database views and curated databases, the issue of data provenance { where a piece of data came from and the process by which it arrived in the database { is becoming increasingly important, especially in scienti c databases where understanding provenance is crucial to the accuracy and currency of data. In this paper we describe an approach to computing provenance when...
متن کاملIntegrating Approximate Summarization with Provenance Capture
How to use provenance to explain why a query returns a result or why a result is missing has been studied extensively. Recently, we have demonstrated how to uniformly answer these types of provenance questions for first-order queries with negation and have presented an implementation of this approach in our PUG (Provenance Unification through Graphs) system. However, for realisticallysized data...
متن کاملOn Answering Why-Not Queries Against Scientific Workflow Provenance
Why-not queries help scientists understand why a given data item was not returned by the executions of a given work�ow. While answering such queries has been investigated for relational databases, there is only one proposal in this area for work�ow provenance, viz. the Why-Not algorithm. This algorithm makes the assumption that the modules implementing the steps of the work�ow preserve the attr...
متن کاملGProM - A Swiss Army Knife for Your Provenance Needs
We present an overview of GProM, a generic provenance middleware for relational databases. The system supports diverse provenance and annotation management tasks through query instrumentation, i.e., compiling a declarative frontend language with provenance-specific features into the query language of a backend database system. In addition to introducing GProM, we also discuss research contribut...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Foundations and Trends in Databases
دوره 1 شماره
صفحات -
تاریخ انتشار 2009